Corpus: hun-sk_web_2016_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 94 97 97 97 97
1000 836 981 990 991 992
10000 6478 9472 9858 9921 9938
100000 16735 27403 29207 29552 29617
1000000 16735 27403 29207 29552 29617


Zipf's diagram for sentence endings


Gnuplot diagram

2200 msec needed at 2018-04-28 00:01